-
Notifications
You must be signed in to change notification settings - Fork 137
Proposal: Gateway API Inference Extension #3800
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
this is really well written :) |
@shaun-nx @salonichf5 Added a diagram, let me know if that helps or if there's anything missing. |
0a69101
to
12db9ec
Compare
Add the design for supporting the Gateway API Inference Extension. This would allow NGF to configure NGINX to route traffic to AI workloads in Kubernetes, using specialized load-balancing.
12db9ec
to
7cb0166
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm, nice work
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work! 🚀
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really great work @sjberman ! 🎉
Add the design for supporting the Gateway API Inference Extension. This would allow NGF to configure NGINX to route traffic to AI workloads in Kubernetes, using specialized load-balancing.
Closes #3716
Checklist
Before creating a PR, run through this checklist and mark each as complete.
Release notes
If this PR introduces a change that affects users and needs to be mentioned in the release notes,
please add a brief note that summarizes the change.